arXiv:1805.01753v2  [quant-ph]  27  Feb  2019 


Probability  in  two  deterministic  universes 


Mateus  Araujo 

Institute  for  Theoretical  Physics,  University  of  Cologne,  Ztilpicher  Strafe  77, 
50937  Cologne,  Germany 

28th  February  2019 


How  can  probabilities  make  sense  in  a  deterministic  many-worlds  the¬ 
ory?  We  address  two  facets  of  this  problem:  why  should  rational  agents 
assign  subjective  probabilities  to  branching  events,  and  why  should  branch¬ 
ing  events  happen  with  relative  frequencies  matching  their  objective  prob¬ 
abilities. 

To  address  the  first  question,  we  generalise  the  Deutsch-Wallace  theorem 
to  a  wide  class  of  many-world  theories,  and  show  that  the  subjective  prob¬ 
abilities  are  given  by  a  norm  that  depends  on  the  dynamics  of  the  theory: 
the  2-norm  in  the  usual  Many-Worlds  interpretation  of  quantum  mechan¬ 
ics,  and  the  1-norm  in  a  classical  many-worlds  theory  known  as  Kent's 
universe.  To  address  the  second  question,  we  show  that  if  one  takes  the 
objective  probability  of  an  event  to  be  the  proportion  of  worlds  in  which 
this  event  is  realised,  then  in  most  worlds  the  relative  frequencies  will  ap¬ 
proximate  well  the  objective  probabilities.  This  suggests  that  the  task  of 
determining  the  objective  probabilities  in  a  many-worlds  theory  reduces  to 
the  task  of  determining  how  to  assign  a  measure  to  the  worlds. 

We  are  used  to  think  of  a  probabilistic  situation  as  one  where  either  an  event  E  or  an 
event  -iE  happens,  with  probabilities  p  and  1  —  p.  In  many-world  theories,  however, 
probabilistic  situations  are  treated  as  deterministic  branching  situations,  where  some 
worlds  are  created  where  event  E  happens,  and  some  worlds  are  created  where  event 
-i E  happens.  Does  it  still  make  sense  to  assign  probabilities  p  and  1  —  p  to  the  worlds 
with  events  E  and  -t E ? 

This  question  was  already  raised  at  the  inception  of  the  Many- Worlds  interpretation 
by  Everett  in  1957  [1].  Several  early  attempts  were  made  to  understand  probability 
from  a  frequentist  point  of  view  [2-6]  that  were  mathematically  mistaken  [7,8].  Pro¬ 
gress  had  to  wait  until  1999,  when  Deutsch  proposed  a  proof  of  the  Born  rule  from 
decision-theoretical  assumptions,  that  was  subsequently  clarified  and  improved  upon 
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by  Wallace  [9-12].  In  Deutsch  and  Wallace's  proofs  probabilities  are  understood  as 
tools  rational  agents  use  to  make  decisions  about  branching  situations,  in  analogy  to 
single-world  decision  theory. 

Other  successful  derivations  of  the  Born  rule  in  the  Many- Worlds  interpretation  have 
been  proposed  since  then.  Zurek  proposed  a  particularly  simple  one,  that  however 
does  not  address  the  question  of  what  probabilities  should  mean  in  a  many-worlds 
theory  [1  ].  More  recently  Vaidman,  Carroll,  and  Sebens  [14, 15]  provided  proofs 
that  do  offer  such  an  explanation:  in  their  argument  probabilities  are  a  measure  of 
the  lack  of  knowledge  an  agent  has  about  in  which  world  they  are,  at  the  instant  of 
time  after  they  performed  a  measurement  but  before  they  learned  the  outcome.  This 
explanation  is  not  satisfactory,  however,  as  probabilities  must  also  make  sense  before 
a  measurement  is  performed,  where  no  uncertainty  arises. 

In  this  paper,  we  investigate  the  probability  problem  by  focussing  on  general  many- 
world  theories  that  have  a  measurement-like  branching  situation,  where  a  finite  num¬ 
ber  of  possible  outcomes  can  occur,  and  one  set  of  worlds  is  created  with  each  outcome. 
Each  set  of  worlds  has  a  coefficient  associated  to  them,  with  the  only  constraint  that  if 
the  coefficient  is  zero,  then  no  worlds  exist  with  that  outcome. 

Two  concrete  many-world  theories  that  fit  into  this  framework  are  the  usual  Many- 
Worlds  interpretation,  where  the  coefficients  are  the  usual  complex  amplitudes,  and 
a  classical  many-worlds  theory  introduced  by  Adrian  Kent  in  Ref.  [16].  There,  agents 
live  in  a  deterministic  computer  simulation,  and  the  coefficients  are  non-negative  in¬ 
tegers  that  literally  indicate  the  number  of  copies  of  the  simulation  that  are  run  with 
each  outcome  after  branching.  We  generalise  the  Deutsch-Wallace  theorem  to  these 
many-world  theories,  and  show  that  it  implies,  in  Kent's  universe,  that  the  subjective 
probabilities  must  come  from  the  1-norm  of  the  coefficients,  and  in  Many-Worlds  from 
the  2-norm. 

The  decision-theoretical  approach  has  been  criticized  by  several  authors  [16-22]. 
Besides  objections  against  specific  aspects  of  Deutsch's  and  Wallace's  proofs  that  do 
not  apply  to  the  current  work,  the  criticism  focuses  on  three  main  points:  (i)  the  claim 
that  to  derive  the  branching  structure  of  Many-Worlds  via  decoherence  one  needs 
to  assume  the  Born  rule  in  the  first  place,  which  would  make  the  Deutsch-Wallace 
argument  circular,  (ii)  the  observation  that  there  are  irrational  agents  who  do  not  use 
probabilities  to  make  their  decisions,  and  (iii)  the  claim  that  it  is  incoherent  to  use 
decision  theory  to  derive  probabilities  that  are  actually  objective.  With  regards  to  (i), 
it  is  not  true  that  the  derivation  of  the  branching  structure  depends  on  the  Born  rule. 
It  depends  only  on  being  able  to  say  that  quantum  states  that  are  close  in  some  metric 
give  rise  to  similar  physics,  which  is  standard  scientific  practice  [23].  In  any  case, 
in  this  paper  we  are  not  concerned  with  deriving  the  branching  structure,  but  rather 
with  making  sense  of  probabilities  given  that  such  a  branching  structure  exists.  With 
regards  to  (ii)  and  (iii),  one  should  note  that  these  criticisms  apply  equally  well  to 
many-world  and  single-world  versions  of  decision  theory,  so  they  are  actually  against 
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the  whole  idea  of  subjective  probability.  To  a  subjectivist  they  ring  hollow,  as  for  them 
probability  is  nothing  but  a  tool  used  by  rational  agents.  Nevertheless,  it  is  hard  to 
deny  that  there  is  something  objective  about  quantum  probabilities. 

To  address  this  point,  we  show  that  for  any  many-worlds  theory  where  a  fairly  gen¬ 
eral  measure  can  be  assigned  to  the  sets  of  worlds  with  a  given  outcome,  an  analogue 
of  the  law  of  large  number  can  be  proved,  which  says  that  in  most  worlds  the  relative 
frequency  of  some  outcome  will  be  close  to  the  proportion  of  worlds  with  that  out¬ 
come.  This  suggests  that  we  should  define  the  objective  probability  of  an  outcome  as 
the  proportion  of  worlds  with  that  outcome.  While  in  Kent's  universe  it  is  obvious 
how  to  calculate  the  proportion  of  worlds,  and  thus  the  objective  probabilities,  this  is 
not  the  case  in  Many-Worlds.  There,  the  best  this  argument  can  do  is  say  that  one 
should  take  the  relative  frequencies  as  a  guess  for  the  proportion  of  worlds,  as  in  most 
worlds  one  will  be  right. 

We  also  explore  a  non-symmetric  branching  scenario  where  Kent's  universe  and 
Many-Worlds  assign  fundamentally  different  proportions  of  worlds  to  each  outcome, 
and  thus  the  analogy  between  these  many-world  theories  breaks  down.  We  argue  that 
this  is  due  to  the  fact  that  the  number  of  worlds  in  Kent's  universe  increases  upon 
branching,  while  in  Many-Worlds  the  measure  of  worlds  is  conserved.  To  support  this 
argument  we  introduce  a  reversed  version  of  Kent's  universe  where  branching  con¬ 
serves  the  number  of  worlds,  and  show  that  it  assign  proportions  of  worlds  compatible 
with  Many-Worlds.  This  breakdown  suggests  that  one  of  the  key  rationality  principles 
that  Deutsch  and  Wallace  used  in  their  proofs  of  the  Born  rule  -  called  substitutibility 
by  Deutsch  and  diachronic  consistency  by  Wallace  -  is  not  valid  in  Kent's  universe, 
and  therefore  shouldn't  be  accepted  for  every  many-worlds  theory.  The  proof  of  the 
generalised  Deutsch-Wallace  theorem  presented  here  does  not  assume  this  principle, 
but  instead  derives  it  as  a  theorem  in  Many-Worlds  and  the  reverse  Kent's  universe. 


i.  Naive  decision  theory 

The  generalised  many-world  theories  we  shall  be  concerned  with  have  very  little  struc¬ 
ture.  We  only  need  them  to  admit  a  measurement-like  branching  situation  with  n 
possible  outcomes,  where  after  branching  n  sets  of  worlds  are  created,  one  with  each 
outcome.  Each  outcome  i  has  a  complex  coefficient  c,  associated  to  it  whose  precise 
meaning  depends  on  the  physics  underlying  the  many-worlds  theory,  but  obeys  the 
general  constraint  that  if  the  coefficient  is  equal  to  zero,  then  no  worlds  are  created 
with  this  outcome. 

We  want  to  use  this  branching  situation  to  play  a  game  where  an  agent  receives  a 
reward  r,  in  the  worlds  that  are  created  with  outcome  i.  The  game  is  then  defined  by 
a  vector  of  coefficients  c  €  C",  and  a  vector  of  rewards  r  E  IR",  and  can  be  represented 
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by  the  n  x  2  matrix 


G  =  (c,r) 


Ai  r\ 

\pn  Y  n 


to 


where  line  i  specifies  the  coefficient  and  reward  associated  to  the  worlds  with  out¬ 
come  i. 

We  are  concerned  with  the  price  at  which  a  rational  agent  with  full  knowledge  about 
the  situation  would  accept  to  buy  or  sell  a  ticket  for  playing  a  game  G.  This  fair  price 
is  called  the  value  of  the  game,  and  is  denoted  by  V (G).  From  this  price  we  shall  infer 
subjective  probabilities  that  are  attributed  by  the  agent  to  the  outcomes.  To  determine 
this  fair  price  we  shall  use  a  fairly  basic  decision  theory,  taken  from  Ref.  [24],  that  is 
essentially  a  formalisation  of  the  Dutch  book  argument. 

The  first  rationality  axiom  we  demand  is  that  if  in  all  worlds  the  game  gives  the 
same  reward  r\,  the  agent  must  assign  value  T\  to  the  game.  If  the  value  were  higher, 
the  agent  would  lose  money  in  all  worlds  when  buying  a  ticket  for  this  game;  if  it 
were  lower,  the  agent  would  lose  money  in  all  worlds  when  selling  a  ticket  for  this 
game.  This  axiom  implies  that  the  agent  is  indifferent  to  branching  per  se,  and  only 
cares  about  the  rewards  their  future  selves  get.  This  might  be  false,  as  we  can  easily 
conceive  of  an  agent  which  is  reluctant  about  branching,  and  would  only  accept  to  pay 
a  value  smaller  than  n  to  play  this  game.  In  this  case  the  agent  would  also  be  reluctant 
to  sell  a  ticket  for  this  game,  and  demand  a  price  higher  than  r\,  so  there  is  no  fair 
price  that  can  be  agreed  upon,  and  presumably  the  game  simply  won't  be  played.  We 
deem  this  behaviour  to  be  irrational.  This  is  specially  so  in  theories  where  branching 
is  ubiquitous  and  unavoidable,  such  as  Many-Worlds. 

The  axiom  is  then 


•  Constancy  If  for  some  game  G  =  (c,  r)  all  rewards  r,  are  equal  to  r\,  then  V  (G)  =  r\. 

The  second  axiom  we  demand  is  that  if  for  a  pair  of  games  with  equal  coefficients  G 
and  G'  the  rewards  of  the  first  game  are  larger  than  or  equal  to  the  rewards  of  the 
second  game  in  all  worlds,  then  the  agent  must  value  the  first  game  no  less  than  the 
second  game.  This  could  be  false,  for  example,  if  some  agent  actively  wants  to  hurt 
their  future  selves  that  receive  rewards  smaller  than  the  maximal  one,  and  accepts  to 
pay  more  for  a  game  where  this  happens.  We  deem  this  behaviour  to  be  irrational. 
The  axiom  is  then1 

•  Dominance  Let  G  =  (c,r)  and  G'  =  (c,  r')  he  two  games  that  differ  only  in  their 
rewards.  If  r  >  r',  then  V(G)  >  V'(G'). 


1  Dominance  will  only  be  needed  to  prove  Theorem  i,  that  says  that  rational  agents  in  a  many- worlds 
theory  assign  subjective  probabilities  to  the  worlds.  All  other  results  follow  without  it,  including 
Theorem  3  and  Theorem  4,  that  say  that  rational  agents  in  Kent's  universe  and  Many-Worlds  bet 
according  to  the  appropriate  version  of  the  Born  rule. 
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The  third  axiom  we  need  is  that  it  shouldn't  matter  if  a  ticket  for  a  game  with  rewards 
r  +  r'  is  sold  at  once,  or  broken  down  into  first  a  ticket  for  the  a  game  with  rewards  r 
followed  by  a  ticket  for  another  game  with  rewards  r7,  where  these  games  are  played 
using  the  same  branching  situation.  If  the  price  of  the  composed  ticket  were  higher 
than  the  sum  of  the  values  of  the  individual  tickets,  the  agent  would  lose  money  in 
all  worlds  when  buying  the  composed  ticket  and  selling  the  individual  tickets.  On 
the  other  hand,  if  the  price  of  the  composed  ticket  were  lower,  the  agent  would  lose 
money  in  all  worlds  when  buying  both  individual  tickets  and  selling  the  composed 
ticket.  The  axiom  is  then 

•  Additivity  Let  G  =  (c,r)  and  G'  =  (c,  r')  be  two  games  that  differ  only  in  their 
rewards.  Then  the  game  G"  =  (c,  r  +  r')  has  value  V(G")  =  V(G)  +  V(G'). 

This  Additivity  axiom  is  eminently  reasonable  in  the  scenario  considered,  where  the 
agent  should  be  willing  to  act  as  bookie  and  bettor  in  the  game.  In  decision  theory, 
however,  one  usually  considers  scenarios  where  an  agent  is  simply  offered  the  gamble 
and  is  trying  to  decide  how  much  to  pay  for  it,  without  needing  to  act  as  a  bookie.  In 
this  case  it  is  not  irrational  to  set  V  (c,  r  +  r')  <  V  (c,  r)  +  V  (c,  r');  it  is  in  fact  necessary 
to  avoid  pathological  decisions  such  as  Pascal's  Wager  and  the  St.  Petersburg  para¬ 
dox.  As  shown  by  Wallace,  this  scenario  can  be  dealt  with  using  a  more  sophisticated 
decision  theory,  such  as  Savage's  [12,25].  But  we  believe  that  such  decision-theoretical 
sophistication  distracts  from  the  physical  arguments  at  hand,  and  will  stick  with  the 
simpler  bettor/bookie  scenario.  One  should  note,  anyway,  that  even  in  Savage's  the¬ 
ory  Additivity  is  a  good  approximation  when  the  values  being  gambled  are  small 
compared  to  the  bettor's  total  wealth,  and  one  could  constrain  the  analysis  to  games 
in  which  this  condition  is  satisfied. 

The  fourth  and  last  axiom  we  need  from  classical  decision  theory  is  that  the  value 
function  must  be  continuous  on  the  coefficients  and  rewards: 

•  Continuity  Let  (Gjt)  be  a  sequence 2  of  games  such  that  lim^oo  G/c  =  G.  Then  V  (G)  = 
limit-**  V(Gic) 

It  is  here  just  for  mathematical  convenience.  It  can  be  left  out  entirely  if  one  is  happy  to 
restrict  the  rewards  to  be  rational  numbers  (as  more  realistically  they  would  be  integer 
multiples  of  eurocents)  and  the  coefficients  to  belong  to  some  countable  subset  of  the 
complex  numbers  that  depends  on  the  particular  many-worlds  theory  (as  it  would  be 
physically  suspect  to  demand  them  to  be  specified  with  infinite  precision). 

The  axioms  presented  up  to  this  point  are  already  enough  to  imply  that  the  agent 
must  assign  subjective  probabilities  to  the  outcomes  of  the  measurement  in  the  game. 
The  proof  is  an  elementary  exercise  in  decision  theory,  but  we  shall  include  it  here 
anyway  because  it  is  short,  enlightening,  and  mostly  unfamiliar  to  physicists: 


2For  concreteness,  convergence  is  defined  in  the  metric  induced  by  the  norm  ||G||  :=  ||c||3  +  ||r||i. 
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Theorem  1.  Let  G  =  (c,  r)  be  a  game  with  n  outcomes.  A  rational  agent  must  assign  it  value 

V(G)  =  'EiV(c,ei)ri,  (2) 

i= 1 

where  the  vectors  e;  are  f/ze  standard  basis,  and  V (c,  e,)  are  f/ze  subjective  probabilities,  as 

V(c,e,-)>0  and  ^V(c,e;)=l  (3) 

i=  1 

Proof.  By  Additivity 

V(c,r)  =  f^V(c,riei),  (4) 

i=l 

so  we  only  need  to  compute  the  value  of  games  with  a  single  non-zero  reward,  e.g. 
(c,fiei),  upon  which  we  shall  now  focus.  Using  Additivity  again,  we  see  that  for  any 
positive  integer  b  we  have  that 


V  (c,  br\e\)  =  bV(c,nei),  (5) 

and  setting  r\  =  1/b  shows  us  that 

V(C,lei)  =  lV(c'ei^  (6) 

Another  application  of  Additivity  lets  us  conclude  that  for  any  positive  integer  a  we 
have  that 

V(C'lei)  =  Hy(C'ei)'  (7) 

so  this  homogeneity  is  valid  for  any  positive  rational  number.  To  extend  it  for  any 
rational  number  q  we  use  Constancy  to  conclude  that  V(c,0)  =  0,  where  0  is  the 
vector  of  all  zeroes,  and  Additivity  again  to  show  that 

0  =  V(c,0)  =  V(c,</ei)  +  V(c,— <jei),  (8) 

and  thefore  that 

V(c,  -qex)  =  -V(c,qei).  (9) 

To  extend  this  to  any  real  number  p,  let  (qf)  be  a  sequence  of  rational  numbers  con¬ 
verging  to  it.  Then  by  Continuity 

V (c, //ei)  =  lim  V(c,^ei)  =  lim  qkV( c,ei)  =  pV(c,ei),  (10) 

k—>  00  k— >00 

and  therefore  for  any  reward  vector  r  we  have  that 

7(c,r )  =  £V,U(c,e;),  (11) 

i= 1 
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so  the  computation  of  the  value  of  an  arbitrary  game  reduces  to  the  computation  of 
the  values  of  the  elementary  games  (c,  e,),  which  are  by  definition  the  subjective  prob¬ 
abilities.  To  see  that  they  are  positive  and  normalised  first  notice  that  by  Dominance 

V (C/  e;)  >  V (c,  0)  =  0,  (12) 

and  that  by  Constancy 

V(c,l)  =  l  =  £v(c  :,ef),  (13) 

i=l 

where  1  is  the  vector  of  all  ones.  □ 

This  is  as  far  as  we  can  go  by  using  the  naive  decision  theory.  Rational  agents  must 
reason  about  the  games  by  assigning  a  probability  to  each  outcome,  but  the  decision 
theory  is  silent  about  what  the  probabilities  must  be. 

One  can  go  further,  though,  by  assuming  an  additional  axiom  reminiscent  of  the 
disreputable3  Principle  of  Indifference  from  classical  probability  [26].  It  states  that  one 
should  assign  uniform  probabilities  to  symmetric  situations:  coins  and  dice  should  be 
regarded  as  equiprobable  because  there  is  no  preferred  side  of  the  coin  or  face  of  the 
die. 

In  a  deterministic  single-world  theory  the  principle  is  nonsense,  as  a  truly  symmet¬ 
ric  situation  would  be  incapable  of  producing  an  output.  Deterministic  single  world 
theories  that  are  capable  of  producing  outcomes,  such  as  Bohmian  mechanics,  must 
have  a  variable  breaking  the  symmetry.  But  a  rational  agent  that  knows  its  value 
should  definitely  not  assign  uniform  probabilities,  but  rather  probability  one  to  the 
outcome  that  will  actually  happen. 

Probabilistic  single-world  theories  fare  a  bit  better,  as  one  can  have  a  perfectly  sym¬ 
metric  situation  before  the  measurement.  But  after  the  measurement  the  symmetry  is 
necessarily  broken,  as  only  one  outcome  actually  happens,  and  a  rational  agent  that 
knows  which  should  definitely  not  assign  uniform  probabilities. 

In  both  deterministic  and  probabilistic  single-world  theories,  therefore,  an  indiffer¬ 
ence  principle  can  only  apply  to  agents  with  restricted  knowledge:  about  the  present, 
in  the  deterministic  case,  or  about  the  future,  in  the  probabilistic  case.  Only  in  many- 
world  theories  can  the  symmetry  remain  rigorously  unbroken,  even  after  the  measure¬ 
ment  is  made,  and  agents  with  full  knowledge  about  the  game  can  apply  the  indiffer¬ 
ence  principle.  Note  that  even  if  the  agent's  future  selves  could  relay  information  to 
the  past  about  which  outcome  they  experience  it  wouldn't  help,  as  the  agent  already 
knows  that  they  will  have  future  selves  experiencing  all  outcomes. 

The  axiom  is  then 


3It  is  untenable  in  classical  probability  because  in  a  given  situation  there  are  often  several  different 
plausible  symmetries  that  give  rise  to  different  probability  assignments.  This  problem  does  not  arise 
here,  as  the  symmetry  at  hand  will  be  the  one  between  the  coefficients  of  the  worlds,  or  the  amplitudes 
of  the  quantum  state. 
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•  Indifference  Let  G  =  (c,r)  and  a  be  a  permutation.  Then  the  game  with  permuted 
coefficients  and  rezvards  G<t  =  (<r(c),<r(r))  has  value  V(Ga)  =  V(G) 

Notice  that  this  axiom  is  not  merely  saying  that  it  does  not  matter  whether  we  write 
down  1  or  2  to  label  the  outcome  of  a  coin  toss,  but  rather  that  it  does  not  matter  if 
we  exchange  the  coefficients  and  the  rewards  of  the  set  of  worlds  with  outcome  1  with 
those  of  the  set  of  worlds  with  outcome  2.  Worlds  1  and  worlds  2  are  physically  dif¬ 
ferent,  if  nothing  else  for  having  different  measurement  results.  Indifference  implies 
that  whatever  these  differences  are,  the  coefficients  and  rewards  are  the  only  relevant 
ones.  This  is  clearly  not  true  if  outcome  1,  and  not  2,  happens. 

This  axiom  suffices  to  prove  that  symmetric  games  have  uniform  probabilities: 

Lemma  z  (Symmetry).  Let  G  =  (c,  r)  be  a  game  with  n  outcomes  such  that  all  coefficients 
Cj  are  equal  to  c\.  Then 

r(G)  =  ;)x>  (14) 

1  i'=l 

Proof.  Let 

Gat  =  (t7i(c),(7i(r))  =  (c,<7;(r))  (15) 

be  a  version  of  G  with  the  zth  cyclic  permutation  applied  to  the  coefficients  and  re¬ 
wards,  so  that  by  Indifference  V ( GLTi )  =  V(G).  If  one  then  defines  the  reward  vector 

n 

p  =  L(rM'  (l6) 

1=1 

all  its  components  are  equal  to  Ya=\  ri>  so  by  Constancy  the  game  T  =  (c,p)  has  value 



i=i 


(17) 


but  also  by  Additivity 

V(T)  =  f>(c  ,crf(r))  =  (G^)  =  nV  (G),  (18) 

t=i  i=i 

and  therefore 

y(G)  =  “£r'-  (19) 

n  1=1 

□ 

Now,  to  deal  with  games  with  unequal  coefficients,  we  shall  do  a  fine-graining 
argument  to  reduce  them  to  symmetric  games.  How  the  fine-graining  works  depends 
on  the  precise  physics  of  the  many-worlds  theory,  so  we'll  have  different  fine-graining 
axioms  for  different  theories. 
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2.  Kent's  universe 


In  Kent's  universe  the  agent  lives  in  a  deterministic  computer  simulation  run  by  some 
advanced  civilization.  The  branching  happens  at  the  press  of  a  button  that  is  displayed 
on  a  wall,  that  in  addition  contains  a  list  of  non-negative  integers  m  =  (mi,. . . ,mn) 
and  real  numbers  r  =  (74, .  ■■  ,r„).  These  integers  play  the  role  of  the  coefficients  in 
the  game,  and  »z,  is  literally  the  number  of  successor  worlds  which  are  created  with 
reward  r,  in  them. 

In  this  case  it  is  clear  how  to  do  fine-graining:  if  one  plays  the  game 


g=G:;)  <*» 

or  the  game 

G'=^l  r2j,  (21) 

in  both  games  three  successor  worlds  are  created,  one  with  reward  r\  in  it,  and  two 
with  reward  r2.  The  only  difference  is  that  while  in  game  G  both  worlds  with  reward 
r2  are  labelled  with  outcome  2,  in  game  G'  one  of  the  worlds  with  reward  r2  is  labelled 
with  outcome  2  and  the  other  with  outcome  3.  We  postulate  that  this  difference  does 
not  matter,  and  that  V(G' )  =  V  (G).  Since  C  is  symmetric,  we  can  evaluate  V(G') 
using  Lemma  2,  and  thus  determine  that 


V(G)  -  X-rx  +  ^r2. 


(22) 


Generalising  this  argument,  we  can  determine  the  value  of  any  game  by  fine-graining 
it  into  a  symmetric  game.  The  formal  postulate  we  need  is 


•  i-Fine-graining  Let 


/mi 

g\ 

m2 

ri 

\mn 

Yn/ 

(23) 


be  a  game  with  n  outcomes.  Then  for  any  non-negative  integers  mu,  niu  such  that 
mu  +  W12  =  nil  the  fine-grained  game  zvith  n  +  1  outcomes 


(mu 

n\ 

mi  2 

G 

G'  = 

m2 

ri 

\mn 

rnJ 

(24) 


has  value  V(G')  =  V(G) 
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Theorem  3.  Let  G  =  (m,  r)  be  a  game.  A  rational  agent  in  Kent's  universe  must  assign  it 
value 

v(g)  =  irir  (25) 

Proof.  Starting  from  an  arbitrary  //-outcome  game  G  =  (m,  r),  we  can  repeatedly  apply 
i-Fine-graining  to  take  it  to  a  symmetric  game  C  =  (l(C=i  m/)/ r'),  where  1  (GU”';)  is 
the  vector  of  £]”=1  m,  ones.  The  reward  vector  r'  consists  of  ///]  rewards  r\,  m2  rewards 
t2,  etc.,  and  can  be  written  as  r'  =  ®"=1  rf  '1 .  Its  value  V'(G')  can  then  by  calculated 
via  Lemma  2,  and  is  given  by 


V(G') 


1  n  mi 

rrr-^.LLri 

Li=i  mi  i=1  j= 1 


mn- 


(26) 


□ 


3.  Many- Worlds 

In  the  Many-worlds  interpretation  a  measurement  is  a  unitary  transformation,  that  via 
decoherence  gives  rise  to  superpositions  of  quasi-classical  worlds.  We  shall  not  discuss 
how  this  happens,  as  this  has  been  extensively  explored  elsewhere  [27-30].  Rather, 
we  shall  take  for  granted  that  this  is  indeed  the  case,  and  that  one  can  instantiate 
the  games  by  doing  measurements  on  quantum  states,  with  the  complex  amplitudes 
playing  the  role  of  the  coefficients. 

The  game  is  started  by  giving  the  agent  a  state 

lJ/,)  =  Eail  i)’  (27) 

1=1 

where  the  |z)  are  distinguishable  states  of  some  infinite-dimensional  degree  of  free¬ 
dom,  such  as  position.  The  agent  then  does  a  measurement  in  the  computational 
basis:  a  unitary  transformation  that  acts  on  the  basis  state  |  i)  together  with  the  meas¬ 
urement  device  in  the  ready  state  |M?)  and  takes  the  measurement  device  to  the  state 
with  outcome  i  called  |M,): 

\i)\M-,)  ^\i)\Mi).  (28) 

The  game  is  concluded  by  giving  the  agent  reward  r,  in  the  worlds  with  outcome  i, 
with  the  whole  process  taking  the  initial  state  Y!i=i  | AT?)  |r?)  to  the  final  state 

lG)  =  £«ilO|M«)ki)-  (29) 

i=i 

Since  the  measurement  is  done  in  a  fixed  basis,  the  game  is  completely  defined  by  the 
vector  of  coefficients  oc  and  the  vector  of  rewards  r,  and  so  we  can  represent  it  by  the 
matrix  G  =  (a,  r). 
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We  want  to  proceed  as  in  Kent's  universe,  and  use  a  fine-graining  postulate  to  re¬ 
duce  arbitrary  games  to  symmetric  games.  It  is  not  so  obvious,  however,  how  to  do 
fine-graining  in  the  Many-Worlds  picture,  as  we  cannot  count  worlds  in  order  to  say 
that  games  are  equivalent  if  they  assign  the  same  number  of  worlds  to  each  reward.  As 
argued  by  Wallace,  the  continuous  nature  of  quantum  mechanics  and  the  arbitrariness 
of  the  border  between  worlds  make  it  impossible  to  count  them  in  a  physically  mean¬ 
ingful  way  [n].  One  could  try,  instead,  to  arbitrarily  postulate  that  there  is  one  world 
for  each  measurement  outcome  with  nonzero  coefficient,  as  is  often  proposed* 4  [  ,31]. 

To  see  that  this  is  untenable,  consider  the  historical  Stern-Gerlach  experiment  [32], 
that  measured  the  spin5  of  silver  atoms  by  letting  them  accumulate  on  a  glass  plate; 
if  some  atom  ended  up  on  the  left  hand  side  of  the  glass  plate  it  had  spin  +ti\, 
and  if  ended  up  on  the  right  hand  side  it  had  spin  —  tij.  Their  apparatus,  however, 
was  not  doing  a  left/right  measurement,  but  rather  a  much  more  precise  position 
measurement,  that  needs  to  be  coarse-grained  in  order  to  obtain  a  two-outcome  spin 
measurement.  Should  we  then  consider  the  large  number  of  distinguishable  positions 
to  be  the  number  of  worlds  produced  in  this  experiment?  But  if  so,  what  would  we 
make  of  a  more  modern  Stern-Gerlach  experiment,  that  instead  of  the  glass  plate  uses 
a  Langmuir-Taylor  detector,  a  single  hot  wire  that  is  scanned  across  the  neutral  atom 
beam  [  53, 34]?  Should  it  be  taken  to  produce  two  worlds,  one  in  which  the  atom  hit 
the  wire  and  another  in  which  it  didn't?  And  what  if  we  do  a  Frankenstein  version  of 
the  Stern-Gerlach  experiment,  with  a  glass  plate  on  the  left-hand  side  and  a  Langmuir- 
Taylor  detector  on  the  right-hand  side?  Does  it  create  more  worlds  with  spin  +  than 
with  spin  —h^7 

This  should  make  it  clear  that  the  number  of  outcomes  in  a  measurement  is  largely 
arbitrary,  and  of  little  relevance.  What  actually  matters  is  that  atoms  with  spin  +ti\ 
go  left  and  atoms  with  spin  —ti\  go  right;  these  different  experimental  setups  are 
equivalent  ways  to  do  the  measurement,  and  we  shall  base  the  fine-graining  argument 
in  Many-Worlds  precisely  on  this  equivalence. 

Consider  then  a  game  where  a  measurement  is  made  on  the  state 

{ip)  =  ot\l)  +  p\2),  (30) 

reward  r\  is  given  in  the  worlds  with  outcome  1,  and  reward  r2  in  the  worlds  with 
outcome  2,  taking  it  to  the  final  state 

\G)=u\l)\M1)\r1)+p\2)\M2)\r2),  (31) 


which  can  also  be  represented  as  the  game  matrix 


G  = 


(32) 


4To  the  best  of  our  knowledge  there  has  been  no  attempt  to  calculate  the  number  of  worlds  in  an  even 

remotely  realistic  model  of  a  measurement. 

5Note  that  Stern  and  Gerlach  were  not  aware  that  they  were  measuring  spin,  rather  they  interpreted  the 
experiment  as  a  proof  of  Bohr-Sommerfelds  spatial  quantization  hypothesis. 
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[>^  r2 
rz 

Figure  1:  One  can  measure  a  state  in  the  {|1),  |2) }  basis  by  applying  a  unitary  U  to 
1 2)  that  takes  it  to  a  superposition  of  \2)  and  |3),  measuring  the  state  in  the 
{ \\),  |2),  |3) }  basis,  and  label  as  2  both  results  2  and  3. 

An  equivalent  way  to  play  this  game,  shown  in  Fig.  l,  is  to  make  a  measurement  on 
the  state 

W)  =  «|1>  +  pu\2)  =  oc\l)  +  /37|2>  +  fiS\3)  (33) 

instead,  where  an  unitary  U  was  applied  to  \2),  and  apply  the  label  2  and  give  reward 
r2  to  both  outcomes  2  and  3,  leading  to  the  final  state 

|G')  =«|l)|M1)|r1)+/37|2)|M2)|r2)+/lJ|3)|M^)|r2),  (34) 

where  \M'2 )  is  a  measurement  result  physically  distinct  from  |M2),  but  with  the  same 
label. 

This  second  way  to  do  the  measurement  still  gives  outcomes  1  and  2  whenever  the 
initial  state  is  |1)  and  |2),  so  it  respects  the  definition  of  measurement  given  above  and 
normal  practice  in  real  laboratories. 

If  one  does  not,  however,  coarse-grain  the  outcomes  2  and  3  together,  but  rather 
leaves  them  distinct,  then  we  are  playing  instead  the  three-outcome  game 


a 

rA 

h 

D  • 

(35) 

r2/ 

We  postulate  that  this  labelling  choice  does  not  make  a  difference,  so  a  rational  agent 
must  assign  the  same  value  to  games  G  and  G' .  This  suffices  to  fine-grain  arbitrary 
games  into  symmetric  games.  As  an  illustration,  consider  the  game 

G -(*•£)■  (36) 

Doing  the  fine-graining  with  a  unitary  U  that  takes  the  state  |2)  to 

P-i6 

u|2)  =  7f(|2)  +  |3))'  (37) 
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we  take  G  to 


( 

1 

\ 

G’  = 

V2 

^2 

• 

(38) 

\ 

V2 

n, 

/ 

2)  and  3),  we 

take 

G'  to 

/ 1 

O 

1 

ri 

G"  = 

1 

r2 

/ 

(39) 

1 

ri 

l1 

ri) 

and  so  V(G 

") 

=  V(G)  can  be  evaluated  via 

Lemma  2, 

V(G)  = 

1 

5fl 

4 

+  5 

L>- 

(40) 

resulting  in 


We  can  now  prove  the  Born  rule  in  the  general  case  by  formalising  this  argument  via 
the  following  axiom: 


•  z-Fine-graining  Let 


G  = 


(oci  rt\ 
oi2  r2 


(41) 


\0tn  Tn) 

be  a  game  with  n  outcomes.  Then  for  any  complex  numbers  an,  #12  such  that 
y^ttnl2  +  \&i2\2  =  |ai|  the  fine-grained  game  with  n  +  l  outcomes 


/«ii 

g\ 

a  12 

r\ 

G'  = 

a2 

ri 

V  a?; 

rj 

(42) 


has  value  V(G')  =  V(G) 

Theorem  4  (Deutsch-Wallace).  Let  G  =  (a,  r)  be  a  game.  A  rational  agent  in  Many-Worlds 
must  assign  it  value 


V(G)  = 


2  i= 1 


(43) 


Proof.  Let  then  G  =  (a,  r)  be  a  n-outcome  game  such  that  the  coefficients  a,  are  of  the 
form  a.;  =  .  / '  ar>d  do  a  trivial  2-Fine-graining  in  each  outcome  to  take  it  to  a 


(/-outcome  game  G'  =  (cl  ,  r)  with  coefficients  a-  =  a,  =  J  g. 
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Let  then  d  =  Y\l=\  blr  and  define  the  integer  a\  =  dcij/b,  so  that  ft,-  =  \/j-  One 
can  then  fine-grain  each  coefficient  ft,-  into  a  ■  coefficients  equal  to  1  /  \fd,  obtaining  a 

symmetric  game  G"  =  (ldXi  “[)  /  with  reward  vector  r'  =  ®”=1  i^a,\  By  Lemma 

2  the  value  of  G"  is  given  by 


V{G")  = 


E  = 


n  aj 

LLn  = 

-i  i=l;'=l 


i= 1  ui  i= 1  ;= 1 


_ 1 _ r1 

2-‘i= 1  b;  i= i 


,  r;  = 


1 2  i=l 


(44) 


and  by  2-Fine-graining  this  is  equal  to  V (G). 

To  generalise  this  argument  for  a  game  G  =  (ft,  r)  with  arbitrary  complex  coefficients 
ftp  it  is  enough  to  notice  that  numbers  of  the  form  \J\elQ  are  dense  in  the  complex 
plane.  Let  then  ft/f  be  a  sequence  of  coefficient  vectors  such  that  Hm/C  _>OQ  a*.  =  a  and 
that  for  all  k  we  have  «&  =  .  / .  Let  then  G;c  =  (a^,  r).  By  Continuity,  we  have  that 


V(G)  =  lim  V(Gk)  =  lim 

k— >00  k— >00 


M-k 


12  /=! 


«fc/|  L  = 


pEl 

2  f=i 


(45) 


□ 


Note  that  we  did  not  have  to  assume  that  the  initial  state  was  normalised,  as  the 
proof  implies  that  a  factor  of  ||ft must  appear  on  the  denominator  of  the  value  func¬ 
tion.  We  did  have  to  assume  that  the  transformations  between  quantum  states  are 
done  via  unitaries,  but  this  comes  from  the  background  assumption  of  Many-Worlds 
quantum  mechanics. 

In  some  textbooks  one  postulates  the  Born  rule  as  a  fundamental  principle,  and 
motivates  unitary  evolution  from  conservation  of  probabilities,  see  e.g.  Refs.  [  -37]. 
Such  an  approach  would  make  deriving  the  Born  rule  from  the  unitary  evolution  a  bit 
circular.  Other  textbooks,  however,  postulate  unitarity  as  fundamental,  motivated  by 
analogy  with  Hamiltonian  classical  mechanics,  and  consider  the  Born  rule  a  separate 
postulate,  see  e.g.  Refs.  [38-40].  This  latter  approach  parallels  the  historical  develop¬ 
ment  of  the  Schrodinger  equation  and  the  Born  rule  [41,42]. 


3.1.  Generalisation 

To  emphasize  that  the  Born  rule  comes  from  fine-graining  through  unitary  transform¬ 
ations  that  preserve  the  2-norm,  we'd  like  to  generalise  the  fine-graining  argument  to 
transformations  that  preserve  some  other  norm.  Which  norms  should  we  consider?  As 
we  show  in  Appendix  A,  some  weak  consistency  conditions  imply  that  fine-graining 
can  only  be  done  if  the  transformations  preserve  the  p-norm  of  the  vectors. 


M 


Consider,  then,  some  hypothetical6  many- worlds  theory  where  the  p-norm  is  pre¬ 
served.  In  such  a  theory  we  can  fine-grain  the  game 


G  = 


1  n 

2eld  r2 


(46) 


by  using  a  transformation  T  that  takes  the  state  \2)  into7 

-W  2  p 



(47) 


i= 1 


The  fine-grained  game  is  then 


G'  = 


1  n 

l(2P)  r2 


(48) 


where  1(2^  is  the  vector  of  21’  ones,  and  from  the  analogous  argument  we  conclude 
that 

1  2p  —  1 
V(G)  =  -n  +  ^r2. 


2P  2^ 

The  general  p-Born  rule  can  then  be  proven  via  the  following  axiom: 


(49) 


p-Fine-graining  Let 


G  = 


(oi\  n\ 
oi2  r2 


(50) 


\0in  Y  n  J 

be  a  game  with  n  outcomes.  Then  for  any  complex  numbers  ol\\,ol\2  such  that 
(kii|p  +  Wi2\p)  *  =  \oc\\  the  fine-grained  game  with  n  4-1  outcomes 


/«n 

n\ 

a  12 

n 

G'  = 

a2 

C> 

V  a*; 

I'll/ 

(5i) 


has  value  V(G')  =  V(G) 


6One  should  take  seriously  the  "hypothetical"  here,  as  theories  where  the  p-norm  is  preserved  are  rather 
pathological.  As  shown  in  Ref.  [43],  the  only  linear  transformations  that  preserve  the  p-norm  of  all 
vectors  for  p  7^  1, 2  are  permutations  composed  with  phases.  Here  we  get  around  this  by  only  asking 
the  transformation  T  to  preserve  the  norm  of  the  computational  basis. 

7In  this  example  p  must  be  of  the  form  log.,  n  for  integer  n  >  2,  but  in  general  any  real  p  >  1  works. 
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Theorem  5.  Let  G  =  (a,r)  be  a  game.  A  rational  agent  in  a  many-worlds  theory  where 
transformations  preserve  the  p-norm  must  assign  it  value 


V(G)  = 


a 


7>E 

P  i= 1 


ii\pn 


(52) 


Proof.  The  proof  is  similar  to  the  Many- Worlds  case,  so  we  shall  only  sketch  it:  consider 
a  game  G  =  («,r)  with  coefficients  Oij  =  ''  el9i ,  do  a  trivial  p-Fine-graining  to  get 


rid  of  the  phase,  rewrite  the  coefficients  as  for  d  =  111=1  and  a\  =  da,/bu 

1 

fine-grain  it  into  a  game  with  a\  coefficients  equal  to  (1  /d)v  for  each  outcome  i,  use 
Lemma  2  to  conclude  that  the  value  of  the  fine-grained  game  is  given  by  equation  (52), 
and  use  Continuity  to  show  that  this  formula  is  valid  for  any  game.  □ 


Notice  that  not  only  the  proof  doesn't  work  for  the  case  of  the  max-norm  (often 
referred  to  as  p  =  00),  but  the  result  is  also  false,  as  it  is  not  possible  to  fine-grain 
all  games  into  symmetric  games  via  transformations  that  preserve  the  max-norm.  For 
example,  any  fine-graining  of  the  game 


G  = 


(53) 


will  have  at  least  one  coefficient  1  associated  to  reward  r\  and  at  least  one  coefficient  2 
associated  to  reward  r2-  One  could  try,  nevertheless,  to  arbitrarily  define 


V(G)  =  lim 

p—^co 


Ip  i=l 


=  —  E 

#M  L-‘ 


ieM 


(54) 


where  M  is  the  set  of  i  such  that  a;  =  max,  a,  | .  This  max-Born  rule  would  be 
consistent  with  Constancy,  Dominance,  Additivity,  and  Indifference,  but  not  with 
Continuity. 


4.  Objective  probabilities 

We  do  not  accept  that  the  behaviour  of  a  rational  decision  maker  should 
play  a  role  in  modelling  physical  systems  -  Richard  Gill  [ij]. 

Up  to  now  the  discussion  has  been  exclusively  about  subjective  probabilities.  We 
have  argued  only  that  it  would  be  irrational  to  assign  probabilities  different  than  those 
given  by  the  generalised  Deutsch-Wallace  theorem  because  one  would  unjustifiably 
break  the  symmetries  of  the  theory.  We  haven't  argued,  though,  that  it  would  be  ir¬ 
rational  to  assign  different  probabilities  because  they  would  predict  different  relative 
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frequencies.  Such  an  argument  would  be  suspect:  why  should  these  explicitly  subject¬ 
ive  probabilities  be  connected  to  the  clearly  objective  relative  frequencies?  Shouldn't 
relative  frequencies  be  connected  to  objective  probabilities  instead? 

The  situation  is  not  as  bad  as  it  sounds,  because  although  subjective,  these  prob¬ 
abilities  depend  only  on  the  coefficients  of  the  game,  which  are  objective,  and  their 
derivation  is  done  from  defensible  rationality  axioms  together  with  facts  about  the 
world  and  the  theory  describing  it;  it  would  be  rather  surprising  if  these  subjective 
probabilities  turned  out  to  be  completely  fantastic.  In  fact,  such  well-grounded  sub¬ 
jective  probabilities  are  even  taken  to  be  equal  to  the  objective  probabilities  by  Lewis' 
Principal  Principle  [  ],  which  states  that 

Pr0(£)  =  Prs(£|HT),  (55) 

that  is,  the  objective  probability  of  an  event  E  is  equal  to  the  subjective  probabilities 
assigned  to  £  by  a  rational  agent  that  knows  the  history  of  the  world  up  to  this  point 
H  and  the  theory  that  describes  the  world  T. 

This  is  still  a  bit  unsatisfactory,  because  the  relationship  between  objective  probab¬ 
ilities  and  relative  frequencies  shouldn't  depend  on  the  opinions  of  rational  agents  or 
even  their  existence.  We  believe,  after  all,  that  radioactive  elements  have  been  decaying 
on  Earth  much  before  humans  appeared  to  reason  about  them,  and  it  would  be  absurd 
to  assume  that  the  relative  frequencies  budged  at  all  when  we  arrived. 

We  can  do  better,  though.  We  shall  propose  a  definition  of  objective  probability  in 
terms  of  the  proportion  of  worlds  in  which  an  event  is  realised,  and  show  that  the 
connection  of  these  objective  probabilities  with  relative  frequencies  is  mathematically 
identical  to  the  one  in  single-world  theories.  In  this  way  they  fulfil  the  main  role8 
objective  probabilities  must  play,  as  defended  by  Saunders  in  Ref.  [45]. 

This  connection,  in  single-world  theories,  is  given  by  the  law  of  large  numbers,  that 
says  roughly  that  for  a  large  number  of  trials  the  relative  frequencies  will  be  close 
to  the  objective  probability,  with  high  objective  probability9.  More  precisely,  if  the 
objective  probability  of  observing  some  event  is  p,  then  after  N  trials  the  objective 
probability  that  the  frequency  f\,  will  be  farther  than  £  away  from  p  is  upperbounded 
by  a  function  that  is  decreasing  in  N  and  £,  that  is,  that 

Pr0(|/N  -  V\  >  e)  <  2e~2N£l  (56) 

where  the  precise  form  of  the  upper  bound  is  not  relevant,  but  for  concreteness  we 
used  the  one  given  by  the  Hoeffding  inequality. 

8We  talk  about  the  roles  objective  probabilities  play  instead  of  the  definition  of  objective  probabilities  be¬ 
cause  this  is  the  best  we  can  do.  Unlike  subjective  probabilities,  there  is  no  widely  accepted  definition 
of  objective  probability  that  we  could  try  to  satisfy. 

9One  often  hears  a  different  story,  that  in  the  limit  of  an  infinite  number  of  trials  the  relative  frequency 
is  equal  to  the  objective  probability,  full  stop.  This  is  simply  mistaken,  but  given  that  the  mathematics 
we  present  here  are  identical,  those  that  want  to  insist  on  this  mistake  can  do  it  equally  well  in 
many- world  theories. 
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To  see  how  this  works  in  many-world  theories,  we  shall  first  consider  how  the  relat¬ 
ive  frequencies  behave  in  Kent's  universe,  and  then  generalise.  Consider  then  that  one 
is  collecting  relative  frequencies  by  performing  N  repetitions  of  a  n-outcome  meas¬ 
urement,  where  m,-  worlds  are  created  with  outcome  i.  After  N  trials  there  are  nN 
sets  of  worlds,  each  set  identified  by  a  sequence  of  outcomes  s  =  (si, . . . ,  sfa,  with 
Si  £  {1 , ,n}.  The  number  of  worlds  where  a  sequence  s  is  obtained  is  clearly  given 
by 


N 


#, = ip  =  n 


m 


ki 


(57) 


i=l  i= 1 

where  fc,  is  the  number  of  outcomes  i  in  s.  If  we  define  then  the  proportion  of  worlds 
with  outcome  sequence  s  as 

tts  :=  (58) 

Es'  #s' 

then  the  proportion  of  worlds  where  the  relative  frequency  fa  of  outcome  1  is  k/N  is 


#(fc,JV)  =  (  ~  |ttl(i  -  tto 


\N-k 


(59) 


which  is  formally  identical  to  the  binomial  distribution,  and  therefore  allows  us  to 
prove  a  law  of  large  numbers  for  the  proportion  of  worlds,  which  says  that  in  most 
worlds  the  relative  frequency  of  outcome  1  will  be  close  to  jji,  or  more  precisely  that 

>e)  <2e~2N^.  (60) 


This  suggests  that  we  could  try  identifying  objective  probabilities  with  the  propor¬ 
tion  of  worlds  in  a  general  many-worlds  theory  Consider  then  that  in  such  a  theory 
one  is  collecting  frequencies  by  performing  N  repetitions  of  a  n-outcome  measurement 
as  in  Kent's  universe.  Now  the  branching  coefficients  are  not  the  number  of  worlds 
created  with  each  outcome,  however,  and  as  argued  in  section  3  we  do  not  think  that 
it  is  possible  to  assign  a  sensible  world  count.  We  can,  however,  assign  a  sensible 
world  measure10:  a  function  A  that  assigns  a  real  number  As  >  0  to  the  set  of  worlds 
with  outcome  sequence  s,  and  defines  the  measure  of  a  set  of  worlds  with  outcome 
sequences  in  some  set  S  to  be 

A(S)  =  £a8.  (61) 

s£  S 

Since  the  branchings  are  completely  independent,  it  is  natural  to  postulate  that 

N 

As =nAv  (62) 

i= 1 


“Which  does  include  the  counting  measure  as  a  particular  case,  so  we  are  by  no  means  assuming  that 
worlds  cannot  be  counted. 
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that  is,  that  the  measure  of  the  set  of  worlds  with  outcomes  s  is  the  product  of  the 
single-trial  measures,  and  this  is  enough  to  show  that  in  most  worlds  the  relative 
frequencies  will  be  right.  Defining  the  proportion  of  worlds  with  outcome  sequence  s 
to  be 


As  := 


As 


ES'  A/ 

it  follows  that  the  proportion  of  worlds  with  relative  frequency  k/N  of  outcome  1  is 


(63) 


X(k,N)=  Q^Aftl-Ai)*-*, 


(64) 


which  is  again  formally  identical  to  the  binomial  distribution,  and  thus  the  law  of 
large  numbers  for  the  proportion  of  worlds  follows. 

This  implies  that  in  any  many-worlds  theory  with  such  a  product  measure  it  is 
eminently  reasonable  to  count  relative  frequencies,  and  use  them  to  guess  which  is  the 
proportion  of  worlds  with  a  given  outcome:  in  most  worlds  one  will  be  right. 

This  works  perfectly  well  in  Many-Worlds,  if  we  take  the  measure  of  worlds  to 
be  the  one  suggested  by  the  Born  rule,  or,  more  generally,  the  p-Born  rule.  To  see 
that,  consider  again  the  N  repetitions  of  the  H-outcome  measurement,  which  in  Many- 
Worlds  is  represented  by  the  transformation 


N  n 

>-> 


i= 1  j= 1 


N  n 


w)  =  ®Ea;l/)lM))' 

i= 1  ;=1 


(65) 


where  the  final  state  |w)  can  be  written  as  the  superposition  of  the  nN  sets  of  worlds 


N  N 

w)  =  J2  K>  =  EEK  ®  Is;)  I (66) 

s  s  i= 1  j= 1 


If  we  then  postulate  the  measure  of  the  set  of  worlds  |;cs)  to  be 

As  :=  ||  |  zvs)  ||p. 


(67) 


then  it  does  decompose  as  a  product  of  the  single-trial  measures,  as  required.  Defining 
again  the  proportion  of  worlds  with  outcomes  s  to  be 


As  := 


As 

Es'  As'' 


(68) 


we  see  that  the  proportion  of  worlds  with  relative  frequency  fa  =  k/N  is  again  given 
by  the  binomial  distribution,  from  which  the  law  of  large  numbers  for  the  proportion 
of  worlds  again  follows. 

In  summary,  if  the  correct  way  to  measure  the  proportion  of  worlds  is  via  the  p- 
norm,  then  in  most  worlds  we  will  observe  frequencies  conforming  to  the  p-Born 
rule.  Conversely,  if  we  observe  relative  frequencies  conforming  to  the  2-Born  rule,  we 
should  bet  that  the  correct  way  to  measure  the  proportion  of  worlds  is  via  the  2-norm. 
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5.  Multiplying  or  splitting 


Figure  2:  An  agent  performs  a  measurement  described  by  the  coefficients  ci  and  Ci. 

In  the  worlds  where  outcome  2  is  obtained,  the  agent  performs  the  same 
measurement  again. 

The  analogy  between  Kent's  universe  and  Many- Worlds  is  not  perfect,  however,  and 
it  breaks  down  in  the  Once-or-Twice  scenario  shown  in  Fig.  2  (discussed  by  Wallace  in 
Ref.  [23]  and  Sebens  and  Carroll  in  Ref.  [46]).  In  it,  an  agent  performs  a  measurement 
described  by  the  coefficients  C\  and  ci,  and  in  the  worlds  where  outcome  2  is  obtained 
they  perform  the  same  measurement  again.  What  are  the  subjective  and  objective 
probabilities  of  the  sequences  of  outcomes  (1),  (2,1),  and  (2,2)? 

First  we  shall  consider  the  objective  probabilities.  In  Kent's  universe  it  seems  clear 
what  they  are:  there  are  in  total  C\  +  C1C2  +  c\  worlds,  of  which  C\  have  outcome  (1), 
C2C1  have  outcomes  (2, 1),  and  c\  have  outcomes  (2,2),  so  the  proportions  are 


«  _  0 

Bl  ,  |  2' 

C\  +  C1C2  +  c2 

(69a) 

U  _  C2Cl 

B21  1  2' 

C\  +  C\C2  +  C2 

(69b) 

C2 

$22  =  ~  2  ~  2- 
C\  +  C\C2  +  C2 

(69c) 

If  we  measure  frequencies,  which  are  after  all  what  objective  probabilities  should  be 
connected  to,  then  in  most  worlds  they  will  be  close  to  these  proportions. 

In  Many-Worlds  (and  more  generally  in  the  hypothetical  p-theories),  however,  Once- 
or-Twice  is  described  by  the  transformation 

(c1|1)+c2|2))|M?)(c1|1)+c2|2))|M?) 

c1|l)|Mi)(ci|l)+c2|2))|M?)+C2Ci|2)|M2)|l)|Mi)+ci|2)|M2)|2)|M2),  (70) 
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and  the  measure  of  worlds  is  given  by  equation  (6 7),  which  tells  us  that 


Ai 
A21  : 
A22  : 



'  \ct\P  + 

\C2n 
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hi' 

\v' 
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(M 

-  NS 

+ 

|c2 

lV 

|P)2 

(Np  + 

\C2 

|P)2 

(71a) 

(7ib) 

(71c) 


a  fundamentally  different  result.  If  c\  =  Ci,  for  example,  jji  =  1/3,  but  X\  =  1/2. 

What  is  the  source  of  this  disconnect?  Our  proposal  is  that  while  in  Kent's  universe 
the  number  of  worlds  is  quite  literally  being  multiplied  upon  branching,  this  is  clearly 
not  the  case  in  Many-Worlds,  as  branching  increases  neither  number  of  particles  nor 
energy.  In  fact,  the  total  measure  of  worlds,  as  given  by  equation  (67),  is  conserved.  A 
closer  analogue  to  Many-Worlds  would  be  a  reversed  version  of  Kent's  universe  where 
branching  conserves  the  number  of  worlds.  There  the  computer  simulation  starts 
with  a  large  number  of  worlds,  and  when  a  measurement  is  made  each  outcome  is 
imprinted  in  a  subset  of  the  worlds,  with  the  relative  sizes  of  these  subsets  determined 
by  the  coefficients.  In  the  Once-or-Twice  case,  it  suffices  to  start  with  (ci  +  C2)2  worlds. 
After  the  first  branching  (ci  +  C2)ci  are  imprinted  with  outcome  1,  and  (c\  +  02)02 
with  outcome  2.  Out  of  the  (c\  +  02)02  with  outcome  2  a  further  C2C1  receive  a  1,  and 
c\  receive  another  2.  The  proportions  of  worlds  are  given  by 


Cl 


$21 


Cl  +  C2 
C2C1 

(Cl  +c2)2' 


$22  — 


(Ci  +C2)2’ 


(72a) 

(72b) 

(72c) 


now  matching  Many-Worlds11. 

Now  turning  to  the  subjective  probabilities,  we  face  a  difficulty,  because  the  gen¬ 
eralised  Deutsch-Wallace  theorem  proved  here  deals  only  with  simple  measurements, 
not  sequences  of  measurements.  The  original  versions  by  Deutsch  and  Wallace  do 
attribute  probabilities  to  sequences  of  measurements,  though,  through  an  axiom  that 
Deutsch  called  substitutibility  and  Wallace  called  diachronic  consistency.  It  says  that 
the  value  of  a  game  does  not  change  if  one  of  its  rewards  is  replaced  by  a  game  of 
equal  value,  so  if  an  agent  attributes  value  V(G)  to  some  game  G,  they  really  must 
be  indifferent  between  receiving  reward  V(G)  or  playing  G,  even  when  this  reward 
would  be  given  as  part  of  another  game.  More  formally,  it  is 

"It  should  be  clear  that  neither  the  regular  nor  the  reverse  Kent's  universe  are  realistic  analogues  of 
Many-Worlds:  both  of  them  require  an  exponential  number  of  worlds,  either  in  the  end  or  in  the 
beginning  of  the  simulation. 
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•  Substitution  Let  G  =  (c,r)  be  a  game,  and  G'  =  (<:',  r')  another  game  such  that 
V(G')  =  r\.  Then  the  sequential  game  where  an  agent  plays  G  but  instead  of  receiving 
reward  r\  they  play  G'  has  value  equal  to  V(G). 

With  it,  we  can  prove  that  the  subjective  probabilities  in  Many-Worlds  match  the  pro¬ 
portion  of  worlds  from  equations  (71),  as  expected,  but  Substitution  also  implies  that 
in  both  the  normal  and  the  reverse  Kent's  universe  the  subjective  probabilities  match 
those  from  equations  (72).  Here  the  Principal  Principle  raises  a  red  flag,  as  in  the 
regular  Kent's  universe  they  should  match  equations  (69)  instead. 

What  went  wrong?  Well,  Substitution  implies  that,  in  the  case  where  c\  =  C2  =  1, 
one  should  be  indifferent  between  playing  a  game  where  one  future  self  gets  reward  r\ 
and  another  reward  ri,  and  another  game  where  one  future  self  gets  reward  r\  but  two 
get  reward  ri.  If  all  these  future  selves  are  equal,  as  is  the  case  in  Kent's  universe,  one 
should  definitely  not  be  indifferent!  One  should  prefer  the  second  game,  and  assign 
probability  1/3  to  each  outcome12.  This  is  also  the  answer  one  obtains  from  Elga's 
indifference  principle  [  \y]. 

We  should,  therefore,  reject  Substitution  as  a  rationality  principle  valid  for  gen¬ 
eral  many-world  theories,  as  it  seems  valid  only  for  those  where  the  total  measure  of 
worlds  is  conserved  upon  branching.  As  a  substitute,  in  both  the  regular  and  the  re¬ 
verse  Kent's  universe  we  can  unproblematically  follow  the  objective  probabilities,  and 
say  that  a  sequential  game  is  equivalent  to  a  simple  game  where  the  same  rewards  are 
given  in  the  same  number  of  worlds.  In  Many-Worlds  the  objective  probabilities  are 
not  well-established,  so  instead  we  argue  that  different  ways  of  doing  a  measurement 

I2Sebens  and  Carroll  claim,  however,  that  an  agent  that  assigns  these  probabilities  can  be  Dutch-booked 
[46]:  After  the  first  measurement,  but  before  the  second,  an  agent  that  is  ignorant  of  the  outcome 
will  assign  probability  1  /2  to  being  in  each  world,  and  will  therefore  accept  a  bet  that  pays  3€  in  the 
world  with  outcome  1  and  —  3€  in  the  world  with  outcome  2.  After  the  second  measurement,  the 
agent  will  assign  probability  1/3  to  being  in  each  world,  and  will  therefore  accept  a  bet  that  pays  — 4€ 
in  the  world  with  a  single  outcome  1  and  2€  in  the  worlds  with  outcomes  (2, 1)  and  (2, 2).  If  the  agent 
accepts  both  bets,  then  in  all  worlds  they  will  lose  1€. 

The  problem  with  this  argument  is  that  the  agent  would  not  have  accepted  the  first  bet  if  they  knew 
that  they  would  be  multiplied  in  the  world  with  outcome  2:  the  assigment  of  probability  1/2  was  a 
mistake,  that  the  agent  corrected  after  learning  of  the  second  multiplication.  Given  that  they  accepted 
the  incorrect  bet,  however,  the  agent  knows  that  accepting  the  second,  fair,  bet  will  cause  them  to 
be  Dutch-booked,  so  they  would  reject  it.  Note,  furthermore,  that  making  two  bets  about  the  same 
situation  using  two  different  probabilities  generically  leads  to  Dutch-booking,  even  in  single-world 
theories.  Consider  an  agent  that  believes  a  coin  to  be  fair,  and  therefore  accepts  a  bet  that  pays  3€ 
if  heads,  and  — 3€  if  tails.  If  the  agent  changes  their  mind,  and  decides  instead  that  the  coin  has 
probability  2/3  of  coming  up  tails,  and  accepts  a  bet  that  pays  — 4€  if  heads  and  2€  if  tails,  then  the 
agent  will  lose  1€  independently  of  the  result  of  the  coin  flip. 

This  raises  the  question  of  how  a  realistic  agent  in  Kent's  universe  could  ever  assign  probabilities, 
since  they  would  depend  on  the  whole  tree  of  future  branchings.  The  agent  could  simply  recognize 
that  it  is  not  possible  to  determine  the  objective  probabilities,  and  use  only  the  part  of  the  branching 
tree  that  they  can  foresee  to  calculate  their  subjective  probabilities.  Analogously,  a  grandparent  could 
decide  to  divide  their  inheritance  among  their  children  weighted  by  how  the  number  of  grandchildren 
they  begat,  but  refuse  to  speculate  about  how  many  children  each  grandchild  will  have. 
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are  equivalent,  as  done  in  section  3.  Specifically,  we  say  that  measuring  a  state  \ip)  and 
then  a  state  |  cp)  in  the  worlds  with  outcome  2  is  equivalent  to  doing  a  joint  measure¬ 
ment  on  \ip)\(p)  and  coarse-graining  the  outcomes  of  the  measurement  on  (p)  in  the 
worlds  where  the  outcome  of  the  measurement  on  ip)  was  different  than  2.  This  al¬ 
lows  us  to  derive  Substitution  as  a  theorem  in  Many-Worlds  and  in  the  reverse  Kent's 
universe,  but  not  in  the  regular  Kent's  universe.  This  is  done  in  detail  in  Appendix  B. 

In  all  many-world  theories,  these  arguments  imply  that  the  agents  should  be  indif¬ 
ferent  to  the  number  of  branchings,  caring  only  about  which  coefficients  are  associated 
to  which  rewards,  thus  satisfying  the  Principal  Principle. 


6.  Conclusion 

We  investigated  how  subjective  and  objective  probabilities  work  in  many-world  theor¬ 
ies. 

With  respect  to  subjective  probabilities,  we  generalised  the  Deutsch-Wallace  the¬ 
orem,  and  showed  how  it  can  be  broken  down  in  three  parts:  the  first  is  a  decision- 
theoretical  core  common  to  both  single-world  and  many-world  theories,  that  says  that 
rational  agents  should  use  probabilities  to  make  their  decisions.  The  second  part  is  a 
symmetry  argument  very  natural  for  many-world  theories,  but  less  so  for  single-world 
ones,  that  implies  that  rational  agents  should  assign  uniform  probabilities  to  physic¬ 
ally  symmetric  games.  The  last  part  is  a  fine-graining  argument  used  to  reduce  arbit¬ 
rary  games  to  symmetric  ones.  This  reduction  takes  different  forms  depending  on  the 
precise  physics  of  the  many-worlds  theory  in  question:  in  Kent's  universe  it  implies 
that  the  probabilities  come  from  the  1-norm,  while  in  Many-Worlds  it  implies  that 
the  probabilities  come  from  the  2-norm.  While  in  Kent's  universe  the  fine-graining 
is  motivated  explicitly  from  many-worlds  considerations,  the  Many-Worlds  version 
depends  only  on  operational  assumptions,  so  if  one  can  stomach  the  symmetry  argu¬ 
ment,  this  derivation  of  the  Born  rule  can  also  be  considered  valid  for  single-world 
versions  of  quantum  mechanics. 

With  respect  to  objective  probabilities,  we  have  argued  that  they  should  be  identified 
with  the  proportion  of  worlds  in  which  an  event  happens,  and  showed  that  for  any 
measure  of  worlds  that  has  a  product  form  an  analogue  of  the  law  of  large  numbers 
can  be  proven:  in  most  worlds  the  relative  frequency  will  be  close  to  the  proportion  of 
worlds.  In  Kent's  universe,  the  motivating  example,  this  argument  can  tell  us  what  the 
objective  probabilities  are,  as  it  is  obvious  how  to  measure  the  proportion  of  worlds. 
Furthermore,  one  can  use  the  Principal  Principle  as  a  consistent  check,  and  note  that 
they  match  the  subjective  probabilities  from  the  generalised  Deutsch-Wallace  theorem. 

In  Many-Worlds,  however,  it  is  not  clear  how  to  measure  the  proportion  of  worlds, 
so  this  argument  cannot  be  used  to  derive  the  objective  probabilities.  It  could  be  used 
in  the  other  direction,  though:  if  one  takes  both  the  Deutsch-Wallace  theorem  and  the 
Principal  Principle  as  true,  then  one  could  show  that  worlds  should  be  measured  via 
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the  2-norm.  Alternatively,  the  argument  can  simply  be  used  to  say  that  if  you  want 
to  find  out  what  the  proportion  of  worlds  is,  gathering  relative  frequencies  is  a  good 
idea,  as  in  most  worlds  you'll  get  the  right  answer. 

We  have  also  shown  that  the  analogy  between  Many-Worlds  and  Kent's  universe 
breaks  down  in  the  Once-or-Twice  scenario,  and  used  this  breakdown  to  argue  that  a 
key  rationality  principle  used  in  the  original  version  of  the  Deutsch-Wallace  theorem 
-  that  when  playing  a  sequential  game  agents  should  be  indifferent  between  receiving 
a  reward  or  playing  a  subgame  with  the  same  value  -  is  not  valid  in  general  many- 
world  theories,  and  thus  one  should  rely  on  other  arguments  to  calculate  the  value  of 
sequential  games. 


Acknowledgments 

We  thank  Koenraad  Audenaert,  Caslav  Brukner,  Eric  Cavalcanti,  Fabio  Costa,  Daniel 
SuE,  David  Gross,  Markus  Heinrich,  Philipp  Hohn,  Felipe  M.  Mora,  Jacques  Pien¬ 
aar,  Simon  Saunders,  and  David  Wallace  for  useful  discussions.  This  work  has  been 
supported  by  the  Excellence  Initiative  of  the  German  Federal  and  State  Governments 
(Grant  ZUK  81). 


References 

[1]  H.  Everett,  ""Relative  State"  Formulation  of  Quantum  Mechanics,"  Rev.  Mod. 
Phys.  29,  454-462  (1957). 

[2]  D.  Finkelstein,  "The  Fogic  of  Quantum  Physics,"  Transactions  of  the  New  York 
Academy  of  Sciences  25,  621-637  (1963). 

[3]  J.  B.  Hartle,  "Quantum  Mechanics  of  Individual  Systems,"  Am.  J.  Phys.  36,  704 
(1968). 

[4]  B.  S.  DeWitt,  "Quantum  mechanics  and  reality,"  Phys.  Today  23,  30  (1970). 

[5]  N.  Graham,  "The  measurement  of  relative  frequency,"  in  The  many-worlds 
interpretation  of  quantum  mechanics,  B.  DeWitt  and  N.  Graham,  eds.  Princeton 
University  Press,  1973. 

[6]  E.  Farhi,  J.  Goldstone,  and  S.  Gutmann,  "How  probability  arises  in  quantum 
mechanics,"  Ann.  Phys.  192,  368  -  382  (1989). 

[7]  E.  J.  Squires,  "On  an  alleged  "proof"  of  the  quantum  probability  law,"  Phys. 
Lett.  A  145,  67  -  68  (1990). 


24 


[8]  C.  M.  Caves  and  R.  Schack,  "Properties  of  the  frequency  operator  do  not  imply 
the  quantum  probability  postulate,"  Ann.  Phys.  315,  123-146  (2005), 

arXiv : quant-ph/04091 44. 

[9]  D.  Deutsch,  "Quantum  theory  of  probability  and  decisions,"  Proc.  R.  Soc.  Lond.  A 
455,  3129  (1999),  arXiv : quant-ph/990601 5. 

[10]  D.  Wallace,  "Everettian  Rationality:  defending  Deutsch's  approach  to  probability 
in  the  Everett  interpretation,"  Stud.  Hist.  Phil.  Sci.  B  34,  415  -  439  (2003), 
arXiv: quant-ph/0303050. 

[11]  D.  Wallace,  "Quantum  probability  from  subjective  likelihood:  Improving  on 
Deutsch's  proof  of  the  probability  rule,"  Stud.  Hist.  Phil.  Sci.  B  38,  311  -  332 
(2007),  arXiv  :quant-ph/031  21  57. 

[12]  D.  Wallace,  "A  formal  proof  of  the  Born  rule  from  decision-theoretic 
assumptions,"  in  Many  Worlds?  Everett,  Quantum  Theory  &  Reality,  S.  Saunders, 

J.  Barrett,  A.  Kent,  and  D.  Wallace,  eds.  Oxford  University  Press,  2010. 
arXiv:0906.2718  [quant-ph]. 

[13]  W.  H.  Zurek,  "Environment-Assisted  Invariance,  Entanglement,  and 
Probabilities  in  Quantum  Physics,"  Phys.  Rev.  Lett.  90, 120404  (2003), 
arXiv:quant-ph/021  1037. 

[14]  L.  Vaidman,  "Probability  in  the  Many-Worlds  Interpretation  of  Quantum 
Mechanics,"  in  The  Probable  and  the  Improbable:  Understanding  Probability  in 
Physics,  Essays  in  Memory  of  It amar  Pitowsky,  Y.  Ben-Menahem  and  M.  Hemmo, 
eds.  Springer,  2011.  PhilSci  :  8558. 

[15]  S.  M.  Carroll  and  C.  T.  Sebens,  "Many  Worlds,  the  Born  Rule,  and  Self-Locating 
Uncertainty,"  in  Quantum  Theory:  A  Two-Time  Success  Story,  D.  C.  Struppa  and 

J.  M.  Tollaksen,  eds.,  p.  157.  Springer  Verlag,  2014.  arXiv :  1 405 . 7907  [gr-qc]. 

[16]  A.  Kent,  "One  world  versus  many:  the  inadequacy  of  Everettian  accounts  of 
evolution,  probability,  and  scientific  confirmation,"  in  Many  Worlds?  Everett, 
Quantum  Theory  &  Reality,  S.  Saunders,  J.  Barrett,  A.  Kent,  and  D.  Wallace,  eds. 
Oxford  University  Press,  2010.  arXiv:0905.0624  [quant-ph], 

[17]  R.  D.  Gill,  "On  an  Argument  of  David  Deutsch,"  in  Quantum  Probability  and 
Infinite  Dimensional  Analysis:  from  Foundations  to  Applications,  M.  Schurmann  and 
U.  Franz,  eds.,  pp.  277-292.  World  Scientific,  2005.  arXiv :  quant-ph/03071  88. 

[18]  D.  J.  Baker,  "Measurement  outcomes  and  probability  in  Everettian  quantum 
mechanics,"  Stud.  Hist.  Phil.  Sci.  B  38,  153  ~  1^>9  (2007),  PhilSci  :  271  7. 


25 


[19]  D.  Albert,  "Probability  in  the  Everett  Picture,"  in  Many  Worlds?  Everett,  Quantum 
Theory  &  Reality,  S.  Saunders,  J.  Barrett,  A.  Kent,  and  D.  Wallace,  eds.  Oxford 
University  Press,  2010. 

[20]  R.  Dawid  and  K.  Thebault,  "Many  worlds:  decoherent  or  incoherent?,"  Synthese 
192,  1559-1580  (2015),  PhilSci  :  9542. 

[21]  A.  L.  G.  Mandolesi,  "Analysis  of  Wallace's  Proof  of  the  Born  Rule  in  Everettian 
Quantum  Mechanics:  Formal  Aspects,"  Found.  Rhys.  48,  751-782  (2018), 
arXiv: 1504.05259  [quant-ph]. 

[22]  A.  L.  G.  Mandolesi,  "Analysis  of  Wallace's  Proof  of  the  Born  Rule  in  Everettian 
Quantum  Mechanics  II:  Concepts  and  Axioms,"  Found.  Phys.  49,  24-52  (2019), 
arXiv: 1803.08762  [quant-ph]. 

[23]  D.  Wallace,  The  Emergent  Multiverse:  Quantum  Theory  According  to  the  Everett 
Interpretation .  Oxford  University  Press,  2012. 

[24]  J.  Joyce,  The  Foundations  of  Causal  Decision  Theory.  Cambridge  University  Press, 
2008. 

[25]  L.  Savage,  The  Foundations  of  Statistics.  Dover  Publications,  1954  (1972). 

[26]  B.  van  Fraassen,  "Indifference:  The  Symmetries  of  Probability,"  in  Laws  and 
Symmetry.  Oxford  University  Press,  1989. 

[27]  S.  Saunders,  "Decoherence,  relative  states,  and  evolutionary  adaptation,"  Found. 
Phys.  23,  1553-1585  (1993). 

[28]  D.  Wallace,  "Everett  and  Structure,"  Stud.  Hist.  Phil.  Sci.  B  34,  87-105  (2003), 
arXiv: quant-ph/01 071 44. 

[29]  W.  H.  Zurek,  "Decoherence,  einselection,  and  the  quantum  origins  of  the 
classical,"  Rev.  Mod.  Phys.  75,  715-775  (2003),  arXiv: quant-ph/01 051 27. 

[30]  J.  B.  Hartle,  "The  Quasiclassical  Realms  of  This  Quantum  Universe,"  Found. 

Phys.  41,  982-1006  (2011),  arXiv:  0806. 3776  [quant-ph], 

[31]  H.  Price,  "Decisions,  decisions,  decisions:  can  Savage  salvage  Everettian 
probability?,"  in  Many  Worlds?  Everett,  Quantum  Theory  &  Reality,  S.  Saunders, 

J.  Barrett,  A.  Kent,  and  D.  Wallace,  eds.  Oxford  University  Press,  2010. 

PhilSci: 3886. 

[32]  W.  Gerlach  and  O.  Stern,  "Der  experimentelle  Nachweis  der 
Richtungsquantelung  im  Magnetfeld,"  Zeitschrift  ftir  Physik  9,  349-352  (1922). 

[33]  T.  E.  Phipps  and  O.  Stern,  "Uber  die  Einstellung  der  Richtungsquantelung," 

Zeitschrift  fur  Physik  73,  185-191  (1932). 


26 


[34]  R-  Frisch  and  E.  Segre,  "liber  die  Einstellung  der  Richtungsquantelung.  II," 

Zeitschrift  fur  Physik  80,  610-616  (1933). 

[35]  J.-J.  Sakurai,  Modern  Quantum  Mechanics.  Addison  Wesley,  1993. 

[36]  R.  Feynman,  R.  Leighton,  and  M.  Sands,  The  Feynman  Lectures  on  Physics,  Volume 
3.  Addison- Wesley,  1977. 

[37]  L.  Landau  and  E.  Lifshitz,  Quantum  Mechanics:  Non-Relativistic  Theory. 

Pergamon,  1965. 

[38]  J.  von  Neumann,  Mathematische  Grundlagen  der  Quantenmechanik.  Springer,  1932. 

[39]  M.  Nielsen  and  I.  Chuang,  Quantum  Computation  and  Quantum  Information. 
Cambridge  University  Press,  2000. 

[40]  A.  Peres,  Quantum  Theory:  Concepts  and  Methods.  Springer  Netherlands,  2006. 

[41]  E.  Schrodinger,  "Quantisierung  als  Eigenwertproblem,"  Annalen  der  Physik  384, 
361-376  (1926). 

[42]  M.  Born,  "Zur  Quantenmechanik  der  Stofivorgange,"  Zeitschrift  fur  Physik  37, 
863-867  (1926). 

[43]  S.  Aaronson,  "Is  Quantum  Mechanics  An  Island  In  Theoryspace?," 
arXiv : quant-ph/0401 062. 

[44]  D.  Lewis,  "A  Subjectivist's  Guide  to  Objective  Chance,"  in  Studies  in  Inductive 
Logic  and  Probability,  Vol  II.,  R.  C.  Jeffrey,  ed.  University  of  California  Press,  1980. 

[45]  S.  Saunders,  "Chance  in  the  Everett  interpretation,"  in  Many  Worlds?  Everett, 
Quantum  Theory  &  Reality,  S.  Saunders,  J.  Barrett,  A.  Kent,  and  D.  Wallace,  eds. 
Oxford  University  Press,  2010.  arXiv:1609. 04720  [quant-ph]. 

[46]  C.  T.  Sebens  and  S.  M.  Carroll,  "Self-Locating  Uncertainty  and  the  Origin  of 
Probability  in  Everettian  Quantum  Mechanics,"  Br.  J.  Philos.  Sci.  69,  25-74  (2018), 
arXiv: 1405. 7577  [quant-ph]. 

[47]  A.  Elga,  "Defeating  Dr.  Evil  with  Self-Locating  Belief,"  Philosophy  arid 
Phenomenological  Research  69,  383-396  (2004). 

[48]  F.  Bohnenblust,  "An  axiomatic  characterization  of  Lp-spaces,"  Duke  Math.  J.  6, 
627-640  (1940). 


27 


A.  Consistent  fine-graining 


For  which  norms  can  the  fine-graining  argument  work? 

There  are  multiple  ways  to  fine-grain  a  game  with  unequal  coefficients  into  a  sym¬ 
metric  game.  For  example,  if  }i  =  ||  (||lJ")||,  ||l(A||)  ||,  where  is  the  vector  with  n 
ones,  one  can  fine-grain  the  game 


G  = 


(73) 


either  by  first  taking  it  to 


(74) 


and  then  applying  two  more  fine-grainings  to  take  it  to 


G" 


(  1  n  \ 

l(”)  r{2n)  , 

\i («)  4m)y 


(75) 


or  by  taking  G  directly  to  G" ,  which  will  be  possible  only  if  p  =  ||l(n+m)||.  We  want 
all  possible  ways  of  fine-graining  a  game  to  give  same  result,  so  we  demand  the  norm 
to  be  such  that  for  all  vectors  v  and  w  with  disjoint  support 


v  +  w|!  =  ||  (II v||,  ||w||) 


(76) 


We  also  demand  the  norm  to  be  permutation-invariant,  as  it  seems  unphysical  to  at¬ 
tribute  meaning  to  the  labelling  of  the  vectors,  and  that  ||  (1, 1)  ||  7^  1,  because  otherwise 
||  14")  ||=1  for  all  n,  and  it  is  therefore  impossible  to  fine-grain  any  non-trivial  game. 

These  conditions  are  enough  to  show  that  these  norms  must  be  equivalent  to  p- 
norms  when  restricted  to  vectors  of  rational  numbers,  as  can  be  seen  by  adapting  an 
argument  by  Bohnenblust  [48].  We  have  then 

Theorem  6.  Let  ||  ■  ||  :  CN  — >  1R  be  a  permutation-invariant  norm  for  N  >  3  such  that 
||(1,1)||  ^  1  and 

llv  +  wll  =  IIOMUMDII  (77) 

for  all  vectors  v,  w  with  disjoint  support.  Then  for  any  vector  c  such  that  the  absolute  value  of 
its  components  is  rational, 

llcll  =  (ENp)P'  (78) 

i 

for  some  real  number  p  >  1. 
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Proof.  Let  /  be  such  that  /( 1)  :=  ||1||  and  f(n  +  1)  :=  ||(1,/(h))||.  First  note  that 
/(l)  =  1,  as  ||1||  =  ||(1,0)||  =  ||(||(1,0)||,||(0,0)||)|| 

We  need  to  show  that  fin)  is  monotonous,  and  that  f(nk )  =  fk{n).  For  the  former, 
consider  the  identity  2(/(n),0)  =  (/(h),  1)  +  (/(h), —1)  and  take  the  norm  of  both 
sides.  By  the  triangle  inequality 

2/(n)  <  ||  (/(«),  1)||  +  ||  (/(«),-!) ||  =2/(n  +  l).  (79) 

For  the  latter,  first  we  show  that  /(n  +  m)  =  ||  (f(m),  f(n)\\.  Assume  that  it  holds  for 
some  m.  Then 


/(n  + 1  +  m)  =  ||(/(m),/(n  +  l))||  =  ||(/(m),l,/(n))||  =  ||(/(m  +  l),/(n))||.  (80) 

Since  it  holds  for  m  =  1,  by  induction  it  holds  for  all  m.  Now  assume  that  f(nin)  = 
/(n)/(m)  holds  for  some  m.  Then 

f(n(m  +  1))  =  f(nm  +  n)  =  \\(f(nm)rf(n))\\  = /(n)||(/(m),l)||  = /(n)/(m  +  1), 

(81) 

and  therefore  by  induction  this  is  true  for  all  m,  as  it  obviously  holds  for  m  =  1.  This 
implies  that  f{nk )  =  fk{n). 

Now  let  m,  n  >  2  be  some  fixed  integers,  and  h  the  integer  such  that  for  any  positive 
integer  k 

mh  <  nk  <  mh+1.  (82) 

Applying  /(•)  to  these  numbers,  it  follows  that 


h\ogf{m)  <k\ogf{n)  <  {h  +  1) log/(m),  (83) 

and  using  the  fact  that  h  <  k^fff  and  h  >  k^fffi  —  1  we  have  that 

log /(»0  _  !og/(»0  <  log /(»)  <  log /(»0  +  log  f(m)  (g .  v 

log  m  k  log  n  log  n  ~  log  m  k  log  n 

Taking  the  limit  of  k  going  to  infinity  lets  us  conclude  that 


log/(»0  =  log /(») 

log  in  log  n 


(85) 


which  means  that  this  fraction  is  a  constant  independent  of  n  (and  different  than  0  as 
/( 2)  >  1).  Calling  this  constant  1/ p,  we  conclude  that 


/(h)  =  nv. 


Now  for  any  rational  number  111/  n  we  have  that 

ll(Wn)ll  =  i|l(»,».)ll  =  i|l(/(«a/(>»'')ll  =  +  =  (1  +  (m/ny)k 
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so  by  homogeneity  ||(a,  fr)||  =  ( \a\P  +  \b\p)r  for  any  rationals  |fl|  and  \b\,  and  by  in¬ 
duction  for  any  vector  c  such  that  the  absolute  values  of  the  components  are  rational 
numbers 

llc||  =  (ElO'-  <8« 

i 

□ 

If  one  furthermore  assumes  some  regularity  condition,  then  the  result  is  valid  for 
any  complex  vector. 


B.  Reducing  sequential  games  to  simple  games 


Consider  the  sequential  game 

h  ri  \ 

G  =  I  c2  dx  si  I ,  (87) 

V  d.2  S2J 

where  in  the  worlds  with  outcome  1  reward  r\  is  given,  but  in  the  worlds  with  outcome 

2  the  subgame  H  =  (y  Sl  )  is  played.  We  want  to  reduce  it  to  a  simple  game  in  both 
\a2  S2J 

Kent's  universes  and  Many-Worlds. 

In  the  regular  Kent's  universe  C\  worlds  are  created  with  reward  r\,  c2di  worlds  are 
created  with  reward  Si,  and  c2d2  worlds  are  created  with  reward  s2,  so  it  seems  natural 
to  postulate  that  G  is  equivalent  to  the  simple  game 


Cl 

rM 

Cid] 

Si  • 

(88) 

c2d2 

S2  / 

In  the  reverse  Kent's  universe,  there  are  M(cj  +  c2)(di  +  d2)  worlds  in  the  beginning. 
After  the  first  branching,  Mci(A  +  d2)  worlds  are  imprinted  with  outcome  1,  and  the 
remaining  Mc2(di  +  d2)  worlds  are  split  again,  with  Me  2d]  being  imprinted  with  a 
further  outcome  1,  and  Mc2d2  with  outcome  2.  In  the  end  there  are  Me]  (d\  +  d2)  with 
reward  ri.  Me  2d]  worlds  with  reward  Si,  and  Me  2d  2  worlds  with  reward  s2,  so  it  seems 
natural  to  postulate  that  G  is  equivalent  to  the  simple  game 


c\{di  +  d2) 

cid\ 

si  • 

(89) 

c2d2 

S2/ 

Note  that  in  the  reverse  Kent's  universe  Substitution  is  satisfied: 
subgame  H  is 


V(H)  —  {d\Si  +d2s2), 

a  1  +  «2 


the  value  of  the 

(90) 
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and  the  value  of  G  there  is 


V(G)  -  (Cl  +  C2)(dl  +  dl)  (Cl(rfl  +  d2)ri  +  C2(dlSl  +  d 2S2))  (91) 

=  1  (ciri  +c2V(H))/  (92) 

which  is  equal  to  the  value  of  (  Cl 

VC2 

case,  as  that  is  quite  straightforward. 

In  Many-Worlds,  the  game  G  is  instantiated  by  making  a  measurement  on  the  state 
| xp)  =  Ci 1 1)  +  c2|2),  giving  reward  ri  in  the  worlds  with  outcome  1,  and  in  the  worlds 
with  outcome  2  doing  a  measurement  on  the  state  |  cp)  =  d\  |1)  +  d2|2),  finally  giving 
rewards  Si  and  s2  in  the  worlds  with  the  second  outcomes  1  and  2.  These  measure¬ 
ments  take  the  initial  state  |t p)  \M?)\q>)  \ D?)  to  the  final  state 

|G)  =Ci|l)|Mi)|<p)|D?)|n)+c2di|2)|M2)|l)|Di)|r2)+c2d2|2)|M2)|2)|D2)|r3).  (93) 

An  equivalent  way  to  play  this  game  is  to  make  a  joint  measurement  on  the  state 

\ip)  \(p)  =  C\d\\l)  |1)  +  Cid2|l)  1 2)  +  c2di|2)  |1)  +  c2d2 \2)  |2),  (94) 

but  in  the  worlds  where  the  measurement  on  |  xp)  resulted  in  1  apply  the  label  ?  to  both 
outcomes  of  the  measurement  on  |  cp),  leading  to  the  final  state 


r  1 


V(H) 


,  as  required.  We  shall  not  prove  the  general 


|G')  =Cidi|l)|Mi)|l)|D;)|n)+cid2|l)|Mi)|2)|D;/)ki) 

+  c2di|2)|M2)|l)|Di)|si)  +c2d2|2)|M2)|2)|D2)|s2)  (95) 

where  \D'?)  and  |D")  are  measurements  results  physically  distinct  from  |D?),  but  with 
the  same  label. 

If  one  does  not,  however,  coarse-grain  the  results  (1, 1)  and  (1,2)  together,  then  this 
measurement  procedure  can  be  regarded  as  playing  the  simple  game 


/  c\d\  rA 
c\d2  r\ 

c2d\  Si 

\c2d2  s2J 


(96) 


instead.  We  postulate  therefore  that  a  rational  agent  in  Many-Worlds  should  regard  G 
and  G'"  as  equivalent,  or  more  formally  that: 

•  Reduction  The  sequential  game 


/«i  n  \ 


G  = 


0Ln  pi  Si 


V 


Pm 


Sm) 


(97) 
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where  subgame  (fi,  s)  is  played  in  the  zvorlds  with  outcome  n,  has  the  same  value  as  the 
simple  game 


G' 


/  aii/3  r\  \ 

M-n— ?n—l 
Ot-nfi  1  Si 

KnPm— 1  Sm_i 

y  Sm  J 


(98) 


This  Reduction  postulate  suffices  to  prove  Substitution  as  a  theorem,  as  the  value  of 
the  subgame  ( /3,  s )  is 

-i  m 

V(M  =  «£lftl'  (99) 

and  the  value  of  G  is 


V(G)  = 


(  Nph  +  \cc„\PV(p,s)^, 


(100) 


(101) 


as  required. 


32 


